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The ability to calculate precise likelihood ratios is fundamental to many STEM areas, such as 
decision-making theory, biomedical science, and engineering. However, there is no assumption-free 
statistical methodology to achieve this. For instance, in the absence of data relating to covari¬ 
ate overlap, the widely used Bayes’ theorem either defaults to the marginal probability driven 
“naive Bayes’ classifier”, or requires the use of compensatory expectation-maximization techniques. 
Equally, the use of alternative statistical approaches, such as multivariate logistic regression, may 
be confounded by other axiomatic conditions, e.g., low levels of co-linearity. This article takes an 
information-theoretic approach in developing a new statistical formula for the calculation of likeli¬ 
hood ratios based on the principles of quantum entanglement. In doing so, it is argued that this 
quantum approach demonstrates: that the likelihood ratio is a real quality of statistical systems; 
that the naive Bayes’ classifier is a special case of a more general quantum mechanical expression; 
and that only a quantum mechanical approach can overcome the axiomatic limitations of classical 
statistics. 


In recent years, Bayesian statistical research has often 
been epistemologically driven, guided by de Finetti’s 
famous quote that “probability does not exist” [lj. For 
example, the “quantum Bayesian” methodology of Caves, 
Fuchs, & SchackQ has applied de Finetti’s ideas to 
Bayes’ theorem for use in quantum mechanics. In doing 
so, Caves et al. have argued that statistical systems 
are best interpreted by methods in which the Bayesian 
likelihood ratio is seen to be both external to the system 
and subjectively imposed on it by the observerQ. 

From a decision-making perspective the Caves et 
al. approach is problematic. Bayes’ theorem and, 
in particular, the “naive Bayes’ classifier” have been 
used extensively to interpret information systems and 
develop normative decision-making models [4|. While 
subjectivity may play a role in a descriptive model of 
human decision-making, its use in normative analysis 
could suggest the presence of a cognitive “homonculus” 
with the power to influence decision outcomes. Yet 
at a human scale, for instance, an observer’s belief as 
to the chances of a fair coin landing either “heads” or 
“tails” has no known effect. Rather, within normative 
decision-making theory, the “heads Tails” likelihood 
ratio of 0.5:0.5 is only meaningful when considered as 
a property of the coin’s own internal statistical system 
rather than as some ephemeral and arbitrary qualia. 
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However, there are axiomatic difficulties associated 
with Bayes’ theorem, notably its reliance upon the use 
of marginal probabilities in the absence of structural sta¬ 
tistical information, such as estimates of covariate over¬ 
lap. This has led some researchers to attempt to recon¬ 
ceptualise psychology and decision-making theory using 
quantum mechanics [g, [fi] - an approach with intuitive 
merit given that both disciplines apply statistical axioms 
to analyse and interpret probabilistic systems. Yet, de¬ 
spite progress made in this area, the lack of an orthodox 
Copenhagen-based theoretical counterpoint to Caves et 
al. has impeded the development of new normative mod¬ 
els. It is this knowledge gap which this article aims to 
fill. 


I. THE LIMITS OF BAYES’ THEOREM 


Bayes’ theorem is used to calculate the conditional 
probability of a statement, or hypothesis, being true 
given that other information is also true. It is usually 
written as 


P(H t \D) 


P(Hi)P{D\Hi) 


(i) 


Here, P(Hi\D) is the conditional probability of hypoth¬ 
esis Hi being true given that the information D is true; 
P{D\Hi) is the conditional probability of D being true if 
Hi is true; and P{Hj)P{D\Hj) is the sum of the prob- 
o 

abilities of all hypotheses multiplied by the conditional 
probability of D being true for each hypothesis 0j. 
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Street A (Hi) 

Street B (H 2 ) 

Number of houses (n) 

10 

10 

% blue front door (D) 

0.8 

0.7 

% not blue front door ( D ) 

0.2 

0.3 


( 2 ) 


To exemplify using the contingency information in © , 
if one wishes to calculate which of two streets is most 
likely to have a house with a blue front door, then using 
Bayes’ theorem it is trivial to calculate that Street A 
is most likely with a likelihood ratio of approximately 
0.53:0.47, 


p {Hi\ D ) (o.5 x 0.8) + (0.5 x 0.7) 
P(H 2 \D) = 1-P(H 1 \D) = ^ 


« 0.533 , 

« 0.467 , 
( 3 ) 


where P(H;) = 10/(10 + 10) = 0.5 for both i = 1,2. 


However, difficulties arise in the use of Bayes’ theorem 
for the calculation of likelihood ratios where there are 
multiple non-exclusive data sets. For instance, if the in¬ 
formation in © is expanded to include data about the 
number of houses with a garage © then the precise co¬ 
variate overlap (i.e., D\ (ID 2 ) for each street becomes an 
unknown. 



Street A (Hi) 

Street B (H 2 ) 

Number of houses ( n ) 

10 

10 

% blue front door (D±) 

0.8 

0.7 

% with garage (D 2 ) 

0.6 

0.5 


( 4 ) 


All that may be shown is that, for each street, the num¬ 
ber of houses with both features forms a range described 
by © , where n[Hi ) is the total number of houses in street 
i, n(Di\Hi ) is the total number of houses in street i with 
a blue front door, and n(D 2 \Hi ) is the total number of 
houses in street i with a garage, 

n(D 1 rD 2 \H i ) e 

( [n{D!\Hi) + n(D 2 \Hi) - n(Hi) ,... ,mm(n(D 1 \H i ),n{D 2 \H i ))^ 
if n(D!\Hi) + n(D 2 \Hi) > n(ip) , or 
[0 ,..., min(n(Di\Hi),n(D2\Hi))^ 

\in{D 1 \H i ) + n{D 2 \H i )<n{H i ) . 


Specifically for © these ranges equate to 

n(Di fl D 2 \Hi) e {4, 5, 6} , 

n(Di fl D 2 \H 2 ) £ {2, 3, 4, 5} . (6) 


The simplest approach to resolving this problem is 
to naively ignore any intersection, or co-dependence, 
of the data and to directly multiply the marginal 
probabilities [zHl, f° r example]. Hence, given ©, the like¬ 
lihood of Street A having the greatest number of houses 
with both a blue front door and a garage would be cal¬ 
culated as 


P(Hi|P>in£> 2 ) 


0.5 x 0.8 x 0.6 

(0.5 x 0.8 x 0.6) + (0.5 x 0.7 x 0.5) 
0.578 . (7) 


Yet, because the data intersect, this probability value is 
only one of a number which may be reasonably calcu¬ 
lated. Alternatives include calculating a likelihood ratio 
using the mean value /r of the frequency ranges for each 
hypothesis 

P(»[n(Di n D 2 \Hi)}) = -L x 1(4 + 5 + 6) = 0.5 , 

P(p[n(Dx n D 2 \H 2 )}) = x 1(2 + 3 + 4 + 5) = 0.35 
=> P(Hi 1/j.Di (~l D 2 ) « 0.588 ; 

( 8 ) 

and taking the mean value of the probability range de¬ 
rived from the frequency range 

mmP(H 1 \D 1 (1 D 2 ) = —, 

4 + 5 

maxP(7J 1 |D 1 nP 2 ) = 7 —^ 

6 + 2 

=> |£>i n D 2 )} » 0.597 . (9) 

Given this multiplicity of probability values, it would 
seem that none of these methods may lay claim to norma- 
tivity. This problem of covariate overlap has, of course, 
been previously addressed within statistical literature. 
For instance, the “maximum likelihood” approach, of 
Dempster, Laird, & Rubin (Tol| , has demonstrated how 
an “expectation-maximization” algorithm may be used 
to derive appropriate covariate overlap measures. Indeed, 
the mathematical efficacy of this technique has been con¬ 
firmed by Wu(TTl|. However, from a psychological stand¬ 
point, it is difficult to see how such an iterative methodol¬ 
ogy could be employed in human decision-making, given 
mankind’s reliance on heuristics rather than cognitively 
expensive exactitudes[l2]. Since there is also little evi¬ 
dence that the naive Bayes’ classifier forms any part of 
the human decision-making process the theoretical ad¬ 
vancement of the psychology of decision-making demands 
a mathematical approach in which covariate overlaps can 
be automatically, and directly, calculated from contin- 


( 5 ) 
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gency data. 


II. A QUANTUM MECHANICAL PROOF OF 
BAYES’ THEOREM FOR INDEPENDENT DATA 


Previously unconsidered, the quantum mechanical von 
Neumann axioms would seem to offer the most promise 
in this regard, since the re-conceptualization of covariate 
data as a quantum entangled system allows for statisti¬ 
cal analysis with few, non-arbitrary assumptions. Unfor¬ 
tunately there are many conceptual difficulties that can 
arise here. For instance, a Dirac notation representation 
of Q as a standard quantum superposition is 



In this example, m cannot be solved since the possible 
values of D\ fl D 2 for each hypothesis © have been 
described as equal chance outcomes within a general su¬ 
perposition of H 1 and H 2 , with the unknown coefficients 
a and f3 assuming the role of the classical Bayesian 
likelihood ratio. 


The development of an alternative quantum mechani¬ 
cal description necessitates a return to the simplest form 
of Bayes’ theorem using the case of exclusive populations 
Hi and data sets D, D, such as given in ©. Here, the 
overall probability of Hi may be simply calculated as 


Wi) 


n(Hl) 

n(-Hi) + n(H 2 ) 


( 11 ) 


The a priori uncertainty in (|2l) may be expressed by con¬ 
structing a wave function in which the four data points 
are encoded as a linear superposition 


|>F) =«i,i |Hi ® D) + a h2 \H X 0 D) 

+ 02,1 \H2 0 D) + CX2 t 2 \H2 0 -D) ■ (12) 


Since there is no overlap between either D and D or the 
populations H\ and H 2 , each datum automatically forms 
an eigenstate basis with the orthonormal conditions 

(Hi ® D\Hi ®D) = (Hi ® D\Hi ® D) = 1 
(H 2 0 D\H 2 0D) = (H 2 0 D\H 2 0D) = 1 

all other bra-kets = 0 , (13) 


where the normalization of the wave function demands 
that 


(T|T} = 1, (14) 


aij gives a total probability of 1 

|«1,11 2 + |«1, l\ 2 + 1^2,l| 2 + |«2,21 2 = 1 ■ (15) 

For simplicity let 

xi=P(D\H 1 \ yi=P{D\Hi) , 

£2 = P(D\H 2 ), V 2 = P(D\H 2 ) , 

X\ = P(Hi), X 2 = P{H 2 ) . (16) 

If the coefficients a* j from (fT2l) are set as required by 
©, it follows that 

|c*l,l|“ = Xi, 1 0 : 1,2 | 2 = 3/1J I Ck2,l | 2 = X 2 , 10^2,21 2 = V2 ; 

(17) 

so that the normalised wave function |T) is described by 

|T) = -^=(yiI|Hi 0 D) + y/yl\Hi 0 D) 

+y/x^\H 2 0 D) + ^\H 2 0 £>)) (18) 

for some normalization constant N. 


The orthonormality condition (flTl) implies that 

N = xi + yi + x 2 + y 2 = Xi + X 2 , (19) 

thereby giving the full wave function description 

® D ) + i/yi\Hi ® D) + ^/X2 \H2 ® D) + y/i/2 \h 2 ® D ) 
“ x/Ai + X 2 

( 20 ) 

If the value of P(H\\D) is to be calculated, i.e., the prop¬ 
erty D is observed, then the normalized wave function 
m necessarily collapses to 

|T') = ai |Hi ® Di) + a 2 \H 2 0 Di) , (21) 

where the coefficients ai >2 may be determined by pro¬ 
jecting |T) on to the two terms in |T') using (fl3l) . giving 


ai = (T'|Hi®D) = 


Xi 


X 1 + X 2 ’ 


a2 = <T'| H 2 0D) = • (22) 

Normalizing m with the coefficient N' 

IU - 7W(\^T, lH '® D) + \lxr+T, |Hj ® D >) ■ 

(23) 


so that the sum of the modulus squares of the coefficients 
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and using the normalization condition (1141) , implies that 


i = /*'!*') = JL f + 3:2 ) 

' 1 ' N'\X 1 +X 2 X 1 +X 2 ) 


-A N' = 


*1 + X2 

Xi + x 2 ' 


(24) 


Thus, after collapse, the properly normalized wave func¬ 
tion (fTill becomes 


14-') = 


*1 


*1 + X2 


|A ® D) 


*2 


X\ + X 2 


\H 2 ® D) 


(25) 

which means that the probability of observing \H\ ® D) 
is 


P(\H 1 ®D)) = 


Xl 


a\ 


Xi 


Xi + X2 


Xi + x 2 


(26) 


This is entirely consistent with Bayes’ theorem and 
demonstrates its derivation using quantum mechanical 
axioms. 


III. A QUANTUM MECHANICAL 
EXPRESSION TO CALCULATE LIKELIHOOD 
RATIOS, WITH CO-DEPENDENT DATA 


Having established the principle of using a quantum 
mechanical approach for the calculation of simple likeli¬ 
hood ratios with mutually exclusive data © , it is now 
possible to consider the general case of n hypotheses and 
m data (Ell), where the data are co-dependent, or inter¬ 
sect. 



Hi 

h 2 


H n 

A 

Xl,l 

*1,2 


%l,n 

A 

x 2 ,l 

*2,2 


%2,n 






D m 

%m,l 

2 


%m,n 


(27) 


Here the contingency table in (1271) has been indexed 
using 

Xi, a > ol = 1,2,..., n; i = 1,2,..., m . (28) 

While the general wave function remains the same as be¬ 
fore, the overlapping data create non-orthonormal inner 


products which can be naturally defined as 

(H a ® Di\Hf> ® Dj) = c“ <5 a/3 , c“ = c£ € R , eg = 1 , 

(29) 

Assuming, for simplicity, that the overlaps c“- are real, 
then there is a symmetry in that c“ = c“ ( for each a. 
Further, for each a and i, the state is normalized, i.e., 
eg = 1. The given independence of the hypotheses H a 
also enforces the Kroenecker delta function, 5 a p. 


The Hilbert space V spanned by the kets \H a ®Di) 
is run-dimensional and, because of the independence of 
H a , naturally decomposes into the direct sum (I5D1) with 
respect to the inner product, thereby demonstrating that 
the non-orthonormal conditions are the direct sum of m 
vector spaces V a : 


V = Span({| H a ® A)}) = 0 V a , dim V a = m . 

a=l 

( 3 °) 

Since the inner products are non-orthonormal, each 
V a must be individually orthonormalised. Given that 
V splits into a direct sum, this may be achieved 
for each subspace V a by applying the Gram-Schmidt 
algorithm[l~3l| to {| H a ® A)} of V. Consequently, the 
orthonormal basis may be defined as 

n 

\K?) = E A tk I H a ® D k ) , < A Q I K?) = Sij , 

k =1 

(31) 

for each a = 1,2,... ,n with m x m matrices Af k , for 
each a. 

Substituting the inner products E51) gives 

m 

A % A %'Ckk'= 6ij Va = l,2(32) 

fc,fc'= l 

The wave-function may now be written as a linear com¬ 
bination of the orthonormalised kets j/\ “) with the co¬ 
efficients bf, and may be expanded into the \H a eg) A) 
basis using (ED, i.e., 

I*> = E b ? \ K i) = E h i A ik W a ® D k ) . (33) 

a.,i ot.,i,k 

As with (fT71) from earlier, the coefficients in (1551) should 
be set as required by the contingency table 

(34) 

i 

where, to solve for the 6-coefficients, (1551) may be used to 
invert 

E E b i A ?k A jk' c kk , = Vx^Af k , c%, k , (35) 

k,k' i k,k' 
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giving 

bj = V^ A< jk' c kk' • (36) 

k,k' 

Having relabelled the indices as necessary, a back- 
substitution of ( 1511 ) into the expansion ( 1551 ) gives 


c“ = 1 and c“- = c“-, the general expression (15(1 may be 
written as 

J2_ 

E y/ x i,l x i,l c ij 

P(H 1 \D 1 nD 2 ) = - 

E E s/XiaXjaCfj 

i,j = l CK= 1 


W = b i A i,k \ H a ® Ac) = ^ y^Xto |A* ® A) 1 

cn.,i,k a,k 

(37) 

which is the same as having simply assigned each ket’s 
coefficient to the square root of its associated entry in 
the contingency table. 

The normalization factor for |U/) is simply 1/y/N, 
where N is the sum of the squares of the coefficients b of 
the orthonormalised bases |iF“), 


n = 5 >n 2 = E (E v^ A k'Ak' 

i,a. \k,k f 


= '/xk 


a Xk'aC-kk' ■ 


k.k r 


Thus, the final normalized wave function is 


(38) 


l*> = 


yjXka | H a ® Hk ) 

a.h 


)* ! yjXiaXjfy Cj 

t,j,a 


(39) 


where a is summed from 1 to n, and i,j are summed 
from 1 to 77i. Note that, in the denominator, the 
diagonal term yx,; a Xj a c“-, which occurs whenever i = j, 
simplifies to Xi a since = 1 for all a. 


From (1391) it follows that, exactly in parallel to the 
non-intersecting case, if all properties Di are observed 
simultaneously, the probability of any hypothesis H a , for 
a fixed a, is 


\J X 1, l c l,l + \J X 2, l c 2,2 + V X1 ’ 1X2 ’ 1C 1,2 + V X2 ’ 1X1 ’ 1C 2,1 

~~2_ ; ! 

E J x l,a c l,l + yj x 2,a C 2,2 + V Xl ’ aX2 ’ aC l,2 + V X2 , aXl > aC 2,l 


xi + yi + 2ciWxiyi 


XI + X2 + yi + J/2 + 2c\yjx\y\ + 2c 2y /X2y2 


where, adhering to the earlier notation (usd, 


(41) 


xi = Xi,i = P(D 1 \H 1 ), yi = x 2 ,i = P{D 2 \Hi) , 
x 2 = Xi j2 = P{Di\H 2 ), 2/2 = X 2 ,2 = P(D 2 \H 2 ) , 

X! = P{Hi), X 2 = P(H 2 ) , (42) 

and, for brevity, Ci := c\ 2 , c 2 := 2 • For simplicity, 

P(Hi\Di D D 2 ) will henceforth be denoted as Pi. Imple¬ 
menting (1411) is dependent upon deriving solutions for the 
yet unknown expressions Cj, i = 1,2 which govern the ex¬ 
tent of the intersection in (l29l) . This can only be achieved 
by imposing reasonable constraints upon Cj which have 
been inferred from expected behaviour and known out¬ 
comes, i.e., through the use of boundary values and sym¬ 
metries. Specifically, these constraints are: 

Data dependence.: The expressions Ci must, in some 
way, be dependent upon the data given in the con¬ 
tingency table, i.e., 

ci = Ci(x\,yi, x 2 , y 2 \Xi, X 2 ) , 

c 2 = c 2 (xi,y 1 ,x 2 ,y 2 \Xi,X 2 ) . (43) 

Probability.: The calculated values for Pi must fall be¬ 
tween 0 and 1. Since ay and yi are positive, it 
suffices to take 


Ub?) 2 

P{H a \D! nfi 2 ...n D m ) = ‘ 

L(°“) 


E y/ x ia%jaCfj 


^ ] ■\J‘ x ioi. x i(yC, 

i,j,a 


(40) 


- 1 < a(x 1 , 2 / 1 , x 2 , 2 / 2 ) < 1 . (44) 

Complementarity.: The law of total probability dic¬ 
tates that 


In the case of non-even populations for each hypothesis 
(i.e., non-even priors), each element within (15U1) should 
be appropriately weighted. 

IV. EXAMPLE SOLUTION 

Returning to the problem presented in the contingency 
table Q, it is now possible to calculate the precise prob¬ 
ability for a randomly selected house with the properties 
of “blue front door” and “garage” belonging to Street A 
{Hi). For this 2x2 matrix, recalling from (1551) that 


Pi + Pi = 1 , (45) 

which can easily be seen to hold. 

Symmetry.: The exchanging of rows within the contin¬ 
gency tables should not affect the calculation of Pi. 
In other words, for each i = 1,2, Pi is invariant 
under ay tA yi. This constraint implies that 

Ci(x 1 ,y 1 ,x 2 ,y 2 ) = c i (yi,x 1 ,y 2 ,x 2 ) . (46) 

Equally, if the columns are exchanged then Pi must 
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map to each other, i.e., for each i = 1,2 then Pi o 
P 2 under X\ O X 2 , yi 0 7/2 which gives the further 
constraint that 

ci(x 1 ,yi,X2,y2) = C2{x2,y2,xi,yi) ■ (47) 


Known values.: There are a number of contingency ta¬ 
ble structures which give rise to a known probabil¬ 
ity, i.e., 



Pi 

P‘2 

Di 

1 

1 

d 2 

m 

n 



Hi 

H2 

Di 

d 2 

m 

1 

n 

1 



Hi 

H‘2 

Di 

n 

m 

d 2 

m 

n 



Hi 

h 2 

Di 

n 

n 

d 2 

m 

m 



Hi 

Hi 

Di 

m 

m 

d 2 

m 

m 


Pi = 


TO 


m + n 


Pi = 


m + n 


Pi = 


Pi = 


Pi = 5 , (48) 


where to, n are positively valued probabilities. For 
such contingency tables the correct probabilities 
should alwa ys b e returned by Cj. Applying this 
principle to (|4T1) gives the constraints 

m 2ci (m, 1, n, l)y/m + m + 1 

m + n 2ci(m, 1, n, l)y/m + 2 c 2 (m, 1, n, l)\/n + m + n + 2 ’ 

(49) 


1 2ci(n, m, m, n)y/my/n + m + n 

2 2ci(n, m, m, n)y/rriy/n + 2 c 2 (n, m, m, n)y/rny/n + 2m + 2n 

(50) 


1 2ci (n, m, n, m)y/rriyjn + m + n 

2 2ci(n, m, n, m)y/rny/n + 2 c 2 (n, m, n, m)y/rny/n + 2m + 2n 

(51) 


bles, e.g., 

XI -»• 1.0, V! ->■ 1.0, x 2 ->• 1.0, 1/2 -»■ 0.50 =4> Pi RJ 0.667 , (52) 
xi ->• 0.5, y! -»• 0.5, x 2 -)■ 0.5, 1/2 -4- 0.25 => Pi « 0.667 . (53) 

While homogeneity may be justified for condition¬ 
ally independent data, this is not the case for inter¬ 
secting, co-dependent data since the act of scaling 
changes the nature of the intersections and the rela¬ 
tionship between them. This may be easily demon¬ 
strated by taking the possible value ranges for (P721) 
and (1531) . calculated using (0, which are 

Eq. ^ =►(£>! nx> 2 )^{l} , 

(D 1 rD 2 )\H 2 = {0.5} , 


Eq. m MDiDD 2 )\Hi = {0.0...0.5} , 

(D 1 fl D 2 )\H 2 = {0.0... 0.25} . (54) 

The effect of scaling has not only introduced uncer¬ 
tainty where previously there had been none, but 
has also introduced the possibility of 0 as a valid an¬ 
swer for both hypotheses. Further, the spatial dis¬ 
tance between the hypotheses has also decreased. 
For these reasons it would seem unreasonable to 
assert that (15^1) and (1551) share the same likelihood 
ratio. 


Using these principles and constraints it becomes possible 
to solve Ci. From the principle of symmetry it follows that 


ci(n, to, to, n) = c 2 (m, n, n, to) = c 2 {n, m, to, n) , 
ci (n, to, n, to) = C 2 (n, to, n, to) = c 2 (n,m,n, to) , (55) 

and that the equalities (|50l) . dim) for Pi = 0.5 automati¬ 
cally hold. Further, (1551) solves to give 


c 2 (to, l,n, 1) 


2- v /to71Ci(to, 1, 71, 1) — TO + 71 
2my/n 


(56) 


which, because ci(n, 1, to , 1 ) = c 2 {m, 1, n, 1 ), finally gives 


ci (n, 1, to, 1) 


2y / ronci(?n, 1, n, 1) — to + n 
2 m-i/n 


(57) 


Substituting g(m,n ) := y/nci(m, l,n, 1) transforms (l57l) 
into an anti-symmetric bivariate functional equation in 

TO, 71, 


g{m,n ) - g{n,m) 


m n 



(58) 


Non-homogeneity.: Bayes’ theorem returns the same 
probability for any linearly scaled contingency ta- 


whose solution is g(m , n) 
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This gives a final solution for the coefficients Ci^ of 

y/xm 


Ci{xi,y 1 ,x 2 ,y 2 ) = 
c 2 (xi,yi,x 2 ,y 2 ) = 


2x 2 y 2 

y/x 2 y 2 

2xxyi 


(59) 


Thus, substituting (l5iH) into (TUI) gives the likelihood ratio 
expression of, 


P(H 1 \D 1 nfi 2 ) = 


xiyi 

X2V2 


+ %i + yi 


+ Xi + Vi + 

X2H2 1 Xiyi 


+ X2 + 2/2 


(6 °) 

Given that the population sizes of Hi and H 2 are the 
same, no weighting of the elements needs to take place. 
Hence, the value of P{Hi\D\ (~l D 2 ) for Q may now be 
calculated to be 


may lead to situations where the process of contingency 
table updating results in the same decisions being made 
repeatedly with the appearance of an ever increasing 
degree of certainty. Indeed, from (RMl) . it would seem 
that the naive Bayes’ classifier is only a special case of a 
more complex quantum mechanical framework, and may 
only be used where the exclusivity of data is guaranteed. 

The introduction of a Hamiltonian operator, and a full 
quantum dynamical formalism, is in progress, and should 
have profound implications for cognitive psychology. In¬ 
evitably, such a formalism will require a sensible contin¬ 
uous classical limit. In other words, the final expressions 
for the likelihood ratios should contain a parameter, in 
some form of h , which, when going to 0, reproduces a 
classically known result. For example, the solutions to 
(l5Ull could be moderated as 


P(Hi\Di fl D 2 ) m 0.5896 . (61) 


ci{xi,yi,x 2 ,y 2 ) 


y/xiyi 

Zx 2 y 2 


(1 - exp (-h)) 


V. DISCUSSION 

One of the greatest obstacles in developing any 
statistical approach is demonstrating correctness. This 
formula is no different in that respect. If correctness 
could be demonstrated then, a priori, there would be 
an appropriate existing method which would negate 
the need for a new one. All that may be hoped for in 
any approach is that it generates appropriate answers 
when they are known, reasonable answers for all other 
cases, and that these answers follow logically from the 
underlying mathematics. 

However, what is clear is that the limitations of the 
naive Bayes’ classifier render any calculations derived 
from it open to an unknown margin of error. Given the 
importance of accurately deriving likelihood ratios this is 
troubling. This is especially true when these calculations 
are used to describe normative psychological theories 
[5] from which inferences are drawn as to the failure of 
human logic. 

As a quantum mechanical methodology this result 
is able to calculate accurate, iteration free, likelihood 
ratios which fall beyond the scope of existing statistical 
techniques, and offers a new theoretical approach within 
cognitive psychology. Further, through the addition of a 
Hamiltonian operator to introduce time-evolution, it can 
offer likelihood ratios for future system states with ap¬ 
propriate updating of the contingency table. In contrast, 
Bayes’ theorem is unable to distinguish directly between 
time-dependent and time-independent systems. This 


c 2 (x 1 ,y 1 ,x 2 ,y 2 ) = ip^(l - exp(—fi)) , (62) 

2x x yi 

so that in the limit of h —» 0, the intersection parameters, 
ci and c 2 , vanish to return the formalism to the classical 
situation of independent data. 


VI. CONCLUSION 

This article has demonstrated both theoretically, and 
practically, that a quantum mechanical methodology 
can overcome the axiomatic limitations of classical 
statistics in respect to their application within cognitive 
psychology. In doing so, it challenges the orthodoxy 
of de Finetti’s epistemological approach to statistics 
by demonstrating that it is possible to derive “real” 
likelihood ratios from information systems without 
recourse to arbitrary and subjective evaluations. 

While further theoretical development work needs to 
be undertaken, particularly with regards to the applica¬ 
tion of these mathematics in other domains, it is hoped 
that this article will help advance the debate over the na¬ 
ture and meaning of statistics, particularly with respect 
to cognitive psychology and decision-making theory. 
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